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ABSTRACT Activation-induced cytidine deaminase (AID) mediates the somatic hypermutation (SHM) of immunoglobulin (Ig) 
variable (V) regions that is required for the generation of antibody diversity and for the affinity maturation of the antibody re- 
sponse against infectious agents and toxic substances. AID preferentially targets WRC (W = A/T, R = A/G) hot spot motifs, par- 
ticularly WGCW motifs that create overlapping hot spots on both strands. In order to gain a better understanding of the genera- 
tion of antibody diversity and to create a platform for the in vitro generation of affinity-matured antibodies, we have established 
a system involving recombinase-mediated cassette exchange (RMCE) to replace the V region and its flanking sequences. This 
makes it possible to easily manipulate the sequence of the Ig gene within the endogenous heavy chain of the Ramos human 
Burkitt's lymphoma cell line. Here we show that the newly integrated wild-type (WT) VH regions introduced by RMCE undergo 
SHM similarly to non-RMCE-modified Ramos cells. Most importantly, we have shown that introducing a cluster of WGCW mo- 
tifs into the complementary determining region 2 (CDR2) of the human heavy chain V region significantly raised the mutation 
frequency and number of mutations per sequence compared to WT controls. Thus, we have demonstrated a novel platform in 
Ramos cells whereby we can easily and quickly manipulate the endogenous human VH region to further explore the regulation 
and targeting of SHM. This platform will be useful for generating human antibodies with changes in affinity and specificity in 
vitro. 

IMPORTANCE An effective immune response requires a highly diverse repertoire of affinity-matured antibodies. Activation- 
induced cytidine deaminase (AID) is required for somatic hypermutation (SHM) of immunoglobulin (Ig) genes. Although a 
great deal has been learned about the regulation of AID, it remains unclear how it is preferentially targeted to particular motifs, 
to certain locations within the Ig gene and not to other highly expressed genes in the germinal center B cell. This is an important 
question because AID is highly mutagenic and is sometimes mistargeted to other highly expressed genes, including proto- 
oncogenes, leading to B cell lymphomas. Here we describe how we utilize recombinase-mediated cassette exchange (RMCE) to 
modify the sequence of the endogenous heavy chain locus in the Ramos Burkitt's lymphoma cell line. This platform can be used 
to explore the regulation and targeting of SHM and to generate human antibodies with changes in affinity and specificity in 
vitro. 
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An effective immune response against infectious agents and 
toxic substances requires the generation of a highly diverse 
repertoire of antibodies that bind with strong affinity. Initially 
antibodies are diversified by the combinatorial rearrangement of 
germ line immunoglobulin (Ig) variable (V), diversity (D), and 
joining (J) gene segments to create the heavy (H) and light (L) 
chain V regions. As B cells interact with antigen and differentiate 
into antibody-secreting cells, these rearranged V regions undergo 
a further diversification process called somatic hypermutation 
(SHM) to produce higher-affinity antibodies that may also have 
changes in fine specificity ( 1 ). While much has been learned about 
the generation of a diverse and high-affinity repertoire, many de- 
tails of how this process is regulated and targeted to antibody 
genes are still poorly understood (2). To achieve a better under- 



standing of this highly mutagenic process, it would be useful to 
have new platforms to study the generation of antibody diversity 
and the affinity maturation of the immune response. Such plat- 
forms would also be useful in generating better monoclonal anti- 
bodies for solving many scientific questions and to diagnose and 
treat disease. 

Activation-induced cytidine deaminase (AID) is required for 
and initiates SHM (3, 4) by deaminating dC residues of single- 
stranded DNA (ssDNA) within transcription bubbles, creating a 
guanine-uracil (G-U) mismatch. These mismatched bases are ei- 
ther processed during replication, resulting in C-to-T transitions; 
removed by uracil-DNA glycosylase (UNG) during short-patch 
base excision repair (BER); or excised by mismatch repair 
(MMR), where the excised DNA strand is resynthesized in an 
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error-prone manner, creating additional mutations (2, 5, 6). AID 
preferentially targets cytosines for deamination in WRC hot spot 
(HS) motifs (7-10), but very little is known about exactly how this 
targeting is achieved or why the V regions and not the C region 
and most other non-Ig genes are not usually targeted (11-13). 
One way to examine the mechanisms and identify the factors re- 
sponsible for the preferential targeting of AID would be to change 
the sequences in the Ig V region and its immediate flanking se- 
quences and regulatory regions. This is preferably done within the 
endogenous locus, since all of the as-acting sequences required for 
this process have yet to be determined. Antibody-forming B cell 
lines in tissue culture allow the examination of multiple different 
engineered versions of parts of the Ig gene, but such experiments 
require the repeated replacement of the wild-type ( WT) Ig gene by 
the modified H or L chain genes (14). Currently, this is most often 
done using the DT40 cell line, which has a high frequency of ho- 
mologous recombination (15, 16). However, DT40 is a chicken B 
cell line where the somatic V region mutation process coexists 
with gene conversion. Even though the competition between 
SHM and gene conversion can be overcome using genetically 
modified versions of DT40 (17-19), these cells do not express the 
second phase of SHM involving BER and MMR that is responsible 
for the acquisition of large numbers of mutations in A-T residues, 
and in this and probably other ways, the mutational process in 
DT40 cells differs from that which occurs in mouse and human B 
cells (20,21). 

The frequency of homologous recombination in the IgH locus 
in centroblast-like human Epstein-Barr virus-negative (EBV~) 
Burkitt's lymphoma cell lines is quite high (22), but these cell lines 
would be even more useful if the replacements could be restricted 
to the rearranged IgH chain locus in a way that favored the re- 
placement of only the endogenous gene and the insertion of the 
transfected gene in the correct orientation. Here we describe the 
use of the recombinase-mediated cassette exchange (RMCE) 
strategy (23-26) as a technique that can be used to easily manip- 
ulate the sequence of the Ig gene within the endogenous locus of 
the Ramos human Burkitt's lymphoma B cell line. Ramos cells are 
an excellent cell line for this system because they constitutively 
express AID and undergo SHM in tissue culture (27-30), are easily 
transfectable, have only one rearranged IgH chain V region, ex- 
press large amounts of surface IgMA, and secrete only small 
amounts of antibody. It has already been shown that these prop- 
erties allow the use of fluorescence-activated cell sorting of Ramos 
cells to sequentially select for rare cells, making variants of the 
endogenous Ramos antibody that have a higher affinity and 
changes in specificity and thus can mimic the affinity maturation 
of normal human antibodies (31). This suggests that if the Ramos 
IgH V regions could be readily replaced by other V regions, this 
platform could be used to affinity mature and change the fine 
specificity of many different monoclonal antibodies. 

Here we describe how to use RMCE to replace the endogenous 
IgH V region, and we show that the newly inserted wild-type V 
region undergoes SHM at approximately the same rate and with 
the same characteristics as that in the original wild-type Ramos 
cells. By engineering a replacement construct with multiple clon- 
ing sites, we can insert any V region in place of the endogenous 
IgH V region and easily manipulate the sequence of the V region as 
well as surrounding cis regulatory sequences at the endogenous 
locus. We also demonstrate that we can increase the relatively low 
spontaneous rate of mutation seen in all cultured cells (32) by 



introducing a cluster of hot spots into the highly mutable comple- 
mentary determining region 2 (CDR2) of the V region without 
changing the distribution or characteristics of the mutational pro- 
cess. This indicates that the Ramos cells provide a useful platform 
for examining the role of the V region sequence environment and 
of ris-acting elements and for affinity maturing of human anti- 
bodies to any antigen. 

RESULTS 

Replacing the endogenous IgH VDJ and flanking regions of 
Ramos cells with Hyg-TK. To establish RMCE (23) in Ramos 
cells, we created a number of constructs that could be homolo- 
gously recombined into the endogenous H chain locus. First, a 
targeting construct (Fig. 1A) was generated that contained 2.2 kb 
of sequence 5' to the promoter of the endogenous rearranged VH 
4-34 gene (5' homologous arm), a lox site (lox), the hygromycin 
and thymidine kinase resistance genes (Hyg-TK) driven by a cy- 
tomegalovirus (CMV) promoter, a second lox (lox) site that dif- 
fers from the first lox site to facilitate directional integration, and 
5.8 kb of sequences 3' to the /jl switch region that include all of the 
IL constant region and the membrane and secretory exons (3' 
homologous arm). When the targeting construct was transfected 
into surface IgM-positive Ramos cells, -2% (5/235) of the trans- 
fected cells were hygromycin resistant and surface IgM negative. 
Since they had lost their surface IgM, it was likely that they had 
undergone homologous replacement of the H chain V, E/x, or S/jl 
(Fig. 1 A) . To confirm this, all five of the independently transfected 
IgM - hygromycin-resistant clones were analyzed by Southern 
analysis and compared to the original unmodified Ramos clone 
(labeled "C" for control) (Fig. IB). This revealed that the produc- 
tive endogenous IgH gene had been disrupted while the other 
Ramos endogenous IgM gene that had previously undergone 
translocation with Myc was still present. Using primer PI, located 
outside the targeting construct, and primer P2, located within the 
targeting construct (Fig. 1A), PCR analysis of genomic DNA from 
each of the 5 independently transfected clones further confirmed 
that the Hyg-TK gene was present and in the correct orientation 
(Fig. 1C). The presence of the Hyg-TK gene and the loss of the 
productive ;u. gene established that the Hyg-TK gene had homolo- 
gously recombined into the endogenous productive H chain locus 
and that the targeting construct was in the correct orientation. 

Cre-mediated replacement of the Hyg-TK with the Ramos V 
region. Next, we transfected a wild-type replacement construct 
containing lox (2L)-VDJ-E/a-S/a-1ox (L3) (WT A) (Fig. 2) into the 
Hyg-TK-expressing cells (Hyg-TK) along with a Cre expression 
plasmid and allowed it to recombine at the lox sites (primary 
replacement of Hyg-TK). The TK gene was present so that we 
could have counterselected with ganciclovir for clones in which 
the Hyg-TK was replaced (23) . However, we found that the Ramos 
cells were very sensitive to ganciclovir, and it was difficult to find a 
dose that did not kill some of the TK-negative cells. Since in this 
case the clones that had successfully integrated the replacement 
vector expressed surface IgM, recovering IgM + cells by 
fluorescence-activated cell sorting provided an easy and reliable 
counterselection in which virtually all of the counterselected cells 
expressed the replacement vector. We therefore elected to isolate 
integrated clones by sorting for IgM and cloning IgM + cells in soft 
agar. The isolation and subsequent characterization (see below) of 
these IgM + cells demonstrated that the endogenous H chain gene 
could be easily restored in Ramos cells. 
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FIG 1 Schematic of the recombinase-mediated cassette exchange strategy. (A) Homologous recombination to create the Hyg-TK Ramos cell line. A schematic 
of the productive endogenous IgH allele includes the leader exon (L), the variable region (VDI), E/j,, S/x, and Cju,. The 5' and 3' homologous arms, the lox sites, 
and the location of the Hyg-TK gene are depicted in the targeting construct. (B) Southern blot of 5 independently isolated Hyg-TK RMCE clones and a control 
(lane C) IgM + wild-type non-RMCE Ramos clone. DNA was digested with NotI and Nsil, shown in panel A, and hybridized with a Cfi probe depicted by the 
black diagonal line in panel A. (C) PCR analysis using genomic DNA of the same clones described in panel B using primers PI and P2 shown in panel A. 



In order to determine whether these newly created WT A 
RMCE clones could undergo a second round of replacement, we 
transfected still another replacement construct that contains a 
C-to-G mutation at position 237 located within framework 2, cre- 
ating a nonsense mutation (WT B) (Fig. 2 and 3 A) into the WT A 
clones along with a Cre expression plasmid and allowed it to re- 
combine at the lox sites. In this case, the successfully integrated 
clones did not express surface IgM due to their nonsense codon, so 
we recovered IgM - cells by fluorescence-activated cell sorting. 
The successful replacement of the WT A insert by WT B demon- 
strated that a secondary replacement of the endogenous H chain 
gene could be achieved in Ramos cells that had already undergone 
one replacement. Independently derived WT A and WT B clones 
were each subcloned, and the WT A and B subclones were found 
to express similar levels of AID transcripts (Fig. 3B). 

RMCE clones undergo SHM with WT frequency and charac- 
teristics. To confirm that cassette exchange would allow a re- 
placed gene to function like the WT gene, we grew constitutively 
mutating Ramos clones with the newly inserted V regions for 
1.5 months to accumulate a sufficient number of mutations and 



sequenced random V regions as we have done in the past (30, 33). 
Ramos cells have only one rearranged V region, allowing us to 
amplify only the V region that was successfully integrated in the 
correct location using primers that anneal upstream of the first lox 
site and within JH6 (Fig. 3A, primers P3 and P4). Since it is pos- 
sible that some mutations could have arisen during the cassette 
exchange (34), cells were collected immediately following sub- 
cloning and their V regions were sequenced and found to not 
contain new mutations. Additionally, only unique mutations were 
scored in order to eliminate the possibility that "founder muta- 
tions" that arose before the start of the experiment would bias the 
results, although this results in the loss of some mutations which 
occur recurrently in hot spots and thus leads to an underestimate 
of the frequency of mutation. Sequence analysis of random V re- 
gions (between primers P3 and P4 shown in Fig. 3A; also Table 1 ) 
from the progeny of Ramos subclones WT A and WT B undergo- 
ing constitutive SHM revealed no significant differences (P = 
0.6109, x 2 test) in the frequency of unique mutations between the 
two sequentially replaced V regions (Fig. 3C; Table 1). This 
showed that, as reported in other systems (35), the small amount 
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FIG 2 Schematic of the Cre-mediated recombination to create the replaced IgH at the endogenous locus. The pentagon represents the addition of Cre, and the 
arrows demonstrate the transfection of WT A to create the primary replacement and the transfection of WT B to create the secondary replacement. The asterisk 
indicates the C-to-G mutation within VDI of WT B. 
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FIG 3 Insertion of a cluster of hot spots into the endogenous heavy chain gene of Ramos cells increases V region mutation. (A) Schematic of the replaced IgH 
locus with primer pairs used for amplification and sequencing indicated by arrows (P3 to P8). (B) Relative AID transcript levels in the various subclones shown 
in the bar graph with standard deviations. (C) A bar graph showing the frequencies of mutations depicted in Table 1 from the promoter, V, and constant regions. 
Error bars represent the standard deviations between 2 clones. (D) Pie charts representing the number of mutations per sequence; the number of sequences 
analyzed is depicted in the center of each circle. 



of nonsense-mediated decay induced by a nonsense mutation in 
WT B that was close to the promoter did not affect the rate of V 
region mutation (Fig. 3C). The newly inserted WT A and WT B V 
regions underwent a similar frequency of mutation as did the orig- 
inal parental wild-type Ramos cells (29, 30, 33) that were used to 
create the RMCE clones. 

To determine whether the RMCE process mistargets AID to 
the regions of the Ig gene flanking V which normally do not un- 
dergo SHM (11, 12, 36), we have also sequenced the promoter 
region from the 2L site to the leader exon (between primers P3 and 
P8) as well as the functional Cju. (37) from the CHI exon to the 
CH2 intron (Fig. 3A, between primers P5 and P6). Although we 
detected a few mutations (0.8 X 10" 4 to 1.3 X 10" 4 mutations/ 



bp) in the promoter and constant regions (Fig. 3C; Table 1), the 
mutation frequency was statistically significantly lower than the 
frequency of V region mutations (V versus promoter, P = 0.002; V 
versus C/n,, P = 0.03) and these mutations were not in AID hot 
spots, suggesting that these low-frequency mutations likely arose 
from PCR error (33). This is in contrast to the V region, where 
39% of the mutations in WT RMCE clones were in WRCY/RGYW 
AID hot spot motifs, similar to the original parental wild-type 
Ramos cells (29). 

Furthermore, Ramos RMCE clones showed characteristics of 
the mutations in the V regions similar to those of the parental 
wild-type Ramos cells in that -85 to 90% of the mutations were in 
G-C residues (see Table SI in the supplemental material). The 



TABLE 1 Mutations from WT and HS cell lines" 


Region 


Clone 


No. of unique mutations 


Total bp sequenced 


Frequency (10 4 ) 


Promoter 


WT A 


7 


63,392 


1.1 




WT B 


6 


75,278 


0.8 




HS A 


4 


63,958 


0.8 




HS B 


6 


45,846 


1.3 


VH 


WT A 


17 


58,166 


2.9 




WT B 


14 


61,372 


2.3 




HS A 


57 


62,288 


9.2 




HS B 


33 


54,960 


6.0 




WT A 


6 


46,848 


1.3 




WT B 


7 


58,560 


1.2 




HS A 


7 


53,436 


1.3 




HS B 


6 


43,920 


1.4 



" The promoter region was amplified using primers P3 and P4 and sequenced using primer P8. This region corresponds to a 566-bp region from the 2L lox site to the leader exon. 
The V region was amplified using primers P3 and P4 and sequenced using primer P7. This region corresponds to a 458-bp region from the leader intron to the end of JH6. The 
functional C/a was amplified using primers P5 and P6 and sequenced using primer P5. This region corresponds to a 732-bp region from the CHI exon to the CH2 intron. 
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FIG 4 Distribution of mutations in WT and HS clones. Distribution of total mutations in the 458-bp VDJ region of HS A and B clones compared to WT 
A. The red line indicates the boundary of the hot spot cluster. Some sequences were dominated by a single highly mutated site which was removed from 
the analysis and replaced by the blue line in order to display the results on the same y axis scale. The black boxes represent the WGCW motifs. In order 
to increase the number of mutations examined, additional sequences from WT A and HS A and B subclones not shown in the previous figures and Table 
1 were added to this figure. 



mutations also occurred in locations similar to those of non- 
RMCE-derived Ramos mutations, since the moving average of 
mutation frequencies (see Materials and Methods) has a correla- 
tion of 0.79 (P<2X 10" 16 ). 

Insertion of a cluster of hot spots into the endogenous heavy 
chain gene of Ramos cells increases the frequency of mutation 
throughout the V region. As in other cultured mouse and human 
B cell lines (32), the overall frequency of mutation in the V regions 
was 2 X 10" 4 to 4 X 10" 4 , which is only 2- to 3-fold greater than 
the PCR error (Table 1) and ~ 10-fold lower than the frequencies 
of mutation that are seen in vivo in germinal center B cells (2, 38). 
As a consequence, only a small percentage of randomly sequenced 
Ramos V regions contain new unique mutations (Fig. 3D), and in 
the V regions that have undergone SHM, the number of new mu- 
tations that accumulate even after 1 to 3 months, while orders of 
magnitude higher than that in non-Ig genes, is small compared to 
the PCR error (Fig. 3; Table 1). We therefore explored ways to 
increase the frequency of mutation in the Ramos cell line so as to 
make it more useful as a tool to explore the effect of changes in the 
local sequence environment and of czs-acting sequences on the 
distribution and characteristics of mutations in the V region and 
to generate better antibodies more quickly. 

An extension of the WRC motif, the WGCW motif, particu- 
larly the AGCT variant of that motif, has been demonstrated to be 
highly susceptible to AID mutation (9, 10, 39-43). The WGCW 
motif represents an overlapping hot spot (a hot spot on both the 
top and bottom strands) within which the adjacent C's on oppo- 
site strands could be targeted for mutation. This motif is enriched 
in mammalian switch regions (44) and has been identified as a site 
of focusing of mutations in vivo in the absence of repair (41-43). 
With the thought that a cluster of hot spots could act as an entry or 



activation site for AID, we engineered a cluster of 4 WGCW hot 
spots within the CDR2 of the Ramos IgH V gene to create 8 total 
hot spots within a 20-bp stretch on both the upper and lower 
strands (Fig. 3 A) without changing the amino acid sequence of the 
VH region. This V region with its cluster of HSs was introduced 
into two different Ramos subclones by RMCE (HS A and HS B in 
Fig. 3), and the frequency of mutation was determined by se- 
quencing (Fig. 3; Table 1). These HS cluster clones had a statisti- 
cally significant ~2- to 4-fold increase in the frequency of unique 
mutations compared to the WT RMCE controls (Fig. 3C; Table 1 ) 
(WT A/B versus HS A, P < 0.0001; WT A/B versus HS B, P = 
0.0009; HS A versus HS B, P < 0.0664). This was associated with a 
larger number of mutated V regions, and there were more V re- 
gions with greater than 1 mutation per V (Fig. 3D) (WT A/B 
versus HS A, P = 0.0002; WT A/B versus HS B, P = 0.0079). While 
the overall frequency of unique mutations in the V regions with 
the HS cluster was increased, the characteristics of the mutations 
in the V regions with the HS cluster resemble the WT in that -85 
to 90% of the mutations were in G-C residues (see Table SI in the 
supplemental material). While it was possible that the AID- 
induced mutations could have been concentrated in and around 
the HS cluster, this did not occur, and the mutations were distrib- 
uted throughout the V region in a pattern that was similar to that 
of the WT RMCE clones (Fig. 4; see also Fig. SI). We repeated the 
moving average analysis (see Materials and Methods) comparing 
the distribution of mutations in the HS cluster and WT V regions 
and again found a highly significant positive correlation (r = 0.52, 
P<2X 10~ 16 ). Thus, we have demonstrated that the insertion of 
a V region by RMCE in Ramos cells containing a cluster of over- 
lapping hot spots increases the frequency of mutation without 
changing their characteristics or distribution. 
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DISCUSSION 

When AID and Ig genes are introduced into non-B cells and B cell 
lines, the transfected Ig genes will undergo AID-induced muta- 
tions, especially if they are surrounded by Ig regulatory elements 
and if AID is overexpressed (29, 33). In addition, transfected 
non-Ig genes, including AID itself, can undergo AID-induced mu- 
tations in B and non-B cells (33). Perhaps one of the most useful 
examples of this was the generation of red fluorescent protein 
(RFP) molecules with new and useful properties when RFP was 
introduced into Ramos cells that were constitutively expressing 
AID (45). These transfected cell systems have made it possible to 
study some aspects of the regulation of AID targeting and to 
change the properties of Ig and non-Ig proteins. Studies with mu- 
tant mice and biochemical analysis have revealed some of the as- 
sociated proteins that contribute to higher mutation rates in Ig 
genes (46-50). Although some as-acting sequences have been 
shown to have a role in targeting AID to antibody V regions (15, 
51) and structure-function studies with AID have revealed the 
parts of the enzyme that are responsible for recognizing hot spot 
motifs (52-55), much remains to be learned about these aspects of 
the mutational process. 

To facilitate such studies, we have modified the Ramos human 
Burkitt's lymphoma cell line so that the structure of the IgH V 
regions and its immediate flanking sequences can be readily and 
repeatedly modified using RMCE. In this study, we have shown 
that once the endogenous VH and its flanking regulatory regions 
are replaced by the RMCE targeting lox-Hyg-TK-lox gene, mod- 
ified VH genes can be readily inserted in one copy in the proper 
orientation into the IgH locus. Ectopic insertions were not de- 
tected. In addition, we found that the replaced wild-type VH re- 
gions mutated at the same frequencies and with the same charac- 
teristics and distribution as did the endogenous Ramos VH 
regions. In addition to the various modifications of the endoge- 
nous Ramos 4-34 VH regions described here, two other human V 
regions have been introduced by RMCE and found to mutate with 
frequencies similar to those of the WT 4-34 V region. In order to 
use this system to affinity mature unrelated human antibodies, it 
will also be necessary to replace the L chain. While this could be 
done by establishing a separate RMCE system with different drug 
markers for the light chain, we have found that a simpler way to do 
this is to identify subclones of Ramos cells that have the IgH 
RMCE vector and have lost the ability to make the endogenous L 
chains. Transfected L chains in ectopic locations do undergo AID- 
induced mutations, thus allowing the mutation of both the H and 
L chains if that is required for affinity maturation (data not 
shown). 

A significant problem with using the Ramos system and other 
cultured B cell systems either to examine the mechanisms of SHM 
or to make better antibodies or other proteins is that the frequency 
of AID-induced mutations, while still high compared to the mu- 
tation rate throughout the genome, is still ~ 10-fold lower than 
that which is observed in vivo and only a fewfold higher than the 
PCR error rate. While this can be overcome to some degree by 
overexpressing AID, this also causes targeting of the C region and 
other highly expressed genes such as AID itself (33, 36), making it 
difficult to study the preferential targeting of AID to antibody V 
regions. Mutant AID molecules with increased activity increase 
the mutation rate but also tend to decrease the specific targeting 



and the viability of the host cell, presumably due to an increase in 
AID mutations in critical non-Ig genes (55). 

Here we have shown that we can avoid these problems by in- 
creasing the frequency of AID-induced mutations in the V region 
by introducing a cluster of WGCW AID hot spots that lead to 
overlapping hot spots in the CDR2 of the VH region. This in- 
creased the mutation rate 2- to 4-fold so that it begins to approach 
the in vivo frequency of mutation without significantly changing 
the overall targeting, characteristics, and distribution of muta- 
tions within the V region. We confirmed that the increase in mu- 
tation was restricted to the VH region that contained the cluster of 
hot spots by sequencing the endogenous L chain genes in Ramos 
cells (37) and showing that the frequencies of mutation in the L 
chains were similar in the WT and HS A and B clones (data not 
shown). The WGCW motif has been shown to be more highly 
targeted for mutation in vivo (9, 10, 39-43, 56). Because of the 
high frequency of mutations at such sites within the mouse JH4 
intron and S/jl region in the absence of repair (41-43, 57, 58) and 
in human V regions ( 10, 59), it has been suggested that it could be 
a potential site for the entry of AID (40, 42). Since AID is a very 
inefficient enzyme (60, 61), it is possible that AID accesses the V 
region elsewhere but is preferentially activated at this motif to 
initiate the process of mutation. Since AID can form multimeric 
complexes (62-64), it is also possible that it could preferentially 
associate with both strands of DNA where there are overlapping 
hot spots and promote multiple deamination reactions (40, 64). 
Whatever the explanation, it is surprising that there was not a local 
increase in mutations in and around the hot spot cluster. Of 1 1 
different subclones expressing the cluster of hot spots, only one 
revealed an increase of mutations in the region of the cluster (data 
not shown), and further experiments will be required to deter- 
mine the true frequency of such clones and the mechanism re- 
sponsible for the increased focusing of mutation with the hope 
that such studies will reveal new information on the targeting and 
activation of AID in vivo. While this work was in progress, RMCE 
was also used in mouse CHI 2 cells to study detailed sequence 
requirements for targeting AID to mouse Ig switch regions to 
mediate class switch recombination (26). These authors con- 
cluded that it was the overlapping nature of the WGCW motif, 
rather than the palindromic characteristics of some representa- 
tives of the motif, that increased the likelihood of mutations and 
subsequent double-stranded DNA breaks in switch regions. 

Here we present a novel platform for the modification of the 
immunoglobulin V region using RMCE in Ramos cells. We have 
shown here that inserting a cluster of overlapping hot spots in the 
H chain V region increases the rate of mutation without affecting 
the characteristics or distribution of those mutations, illustrating 
how this system will be useful in exploring the regulation and 
restriction of V region mutations. Since we can also insert new L 
chains and the Ramos cells have been shown to switch at a low 
frequency from IgM to IgG expression (65), this system will be 
useful for the affinity maturation of human monoclonal antibod- 
ies. 

MATERIALS AND METHODS 

Cell lines and culture conditions. The Ramos Burkitt's lymphoma cell 
line was obtained from Hilda Ye (Albert Einstein College of Medicine, 
Bronx, NY) and grown in Iscove's modified Dulbecco's medium (IMDM; 
BioWhittaker, Walkersville, MD) supplemented with 10% fetal bovine 
serum (Atlanta Biologicals, Lawrenceville, GA) and 100 U/ml penicillin- 
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streptomycin (Mediatech, Herndon, VA). Soft agar cloning was per- 
formed as previously described (30). An IgM + subclone of Ramos 6 (30), 
Ramos 6.25 subclone 8 (6.25), was used as the parental clone for the 
RMCE system. 

RMCE plasmids. Details of the construction of the targeting and re- 
placement constructs are located in the supplemental material (see 
Text SI). Briefly, for the targeting construct, the 5' and 3' homologous 
arms were constructed by PCR amplification from Ramos clone 6 
genomic DNA and the lox and Hyg-TK segments were cloned from the 
pCR4-TOPO-2L-Hyg-TK-L3 vector (Eric Bouhassira, Albert Einstein 
College of Medicine, Bronx, NY). Hygromycin-resistant IgM~ LC + 
Ramos cells transfected with the targeting construct were screened for 
integration of the targeting construct by PCR using the following primers: 
PI, 5' CTATGTGGCGAAAGGCAATCTATC 3', which binds upstream 
of the 5' arm, and P2, 5' GAACTCCATATATGGGCTATGAAC3', which 
binds within the CMV promoter of the Hyg-TK gene. For the replacement 
constructs, the pUC VDJ1 E/liS/u (WT A) replacement was created from a 
series of intermediate plasmids detailed in the supplemental materials (see 
Text SI). The pUC VDJ4-1 Eu.Su. (WT B) and pUC VDJ hot spot Eu.Su, 
(HS) plasmids were created by site-directed mutagenesis by PCR ampli- 
fication of pUC VDJ1 Eu.Su. (WT A) as described in the supplemental 
material (see Text SI). 

Transfection conditions. Ramos cells were electroporated with either 
2 jug of the "targeting construct" or 12.5 u.g of each "replacement con- 
struct" along with 3.2 u.g of Cre using the Nucleofector II (Lonza, Amaxa, 
Cologne, Germany) and Nucleofection (Lonza) kits. pBS 185 CMV-Cre 
was a gift from Eric Bouhassira (Albert Einstein College of Medicine, 
Bronx, NY) and was originally obtained from Clontech (Palo Alto, CA). 
Ramos RMCE clones were selected using 0.6 mg/ml of hygromycin (Cal- 
biochem). 

Fluorescence analysis and sorting. Ramos cells were stained with goat 
anti-human IgM-fluorescein isothiocyanate (FITC) (Southern Biotech- 
nology Associates, Birmingham, AL) and analyzed using the FACScan 
(Becton Dickinson, Franklin Lakes, NJ) cell sorter. For fluorescence- 
activated cell sorting experiments, at least 2 X 10 7 cells were used, stained 
as described above, and sorted using a FACSAria (Becton Dickinson) cell 
sorter. 

Southern blotting. Genomic DNA from hygromycin-resistant IgM~ 
LC + Ramos clones (Hyg-TK) was digested with NotI and Nsil, and the 
DNA fragments were separated on a 0.7% agarose gel, transferred to a 
Hybond N + nylon membrane (GE Healthcare, Amersham Biosciences, 
Piscataway, NT) in 20X SSC (3 M NaCl plus 300 mM sodium citrate), and 
UV cross-linked to the membrane. A Cu. probe containing the majority of 
exons 1 to 3 was generated by digesting the 3' homologous arm DNA with 
Sbfl and Hindi, labeled with [7- 32 P]CTP, hybridized to the blot at 65°C 
for 2 h, and visualized by autoradiography. Clones containing the target- 
ing vector correctly integrated at the functional Cu. allele produced a 
~6.4-kb band as well as a ~7.5-kb band that represented the nonfunc- 
tional Cu. allele. Parental clones RMCE 1 and RMCE 3 (Fig. IB and C) 
were thus identified and used in this study. No significant difference in 
mutation frequency was detected when either the RMCE 1 or the RMCE 3 
clone was used. 

PCR amplification, cloning, and sequencing. Genomic DNA was 
prepared as reported previously (30). The V and Cp, region from Ramos 
clones were amplified using Pfu Turbo Cx Hotstart polymerase from 
genomic DNA by using 30 cycles of 95°C for 30 s, 60°C for V region or 
66°C for Cu. for 30 s, and 72°C for 1 min. Primers for the V region were P3, 
5' TGTCTTCAGATCAGCAGCCTAAAG 3' (5' primer), and P4, 5' CAT 
TCTTACCTGAGGAGACGGTG 3' (3' primer), and for the C/u, region 
wereP5, 5' GCATCCGCCCCAACCCTTTTC 3 ' (5' primer), andP6, 5' G 
CGAATGCCAGACCCGAGTG 3' (3' primer). PCR products were mod- 
ified by A-tailing using Taq polymerase and cloned using the pGEM-T 
vector system. Minipreparations were prepared using the Montage Plas- 
mid Miniprep HTS 96 kit (Millipore, Billerica, MA) and sequenced using 
the ABI 3730 DNA analyzer (Applied Biosystems, Carlsbad, CA). Primers 



used for sequencing were the following: V region, P7, 5' GCACAAGAAC 
ATGAAACACC 3'; IgH promoter region, P8, 5' GGACCCCTGTGAAC 
AGAAAA 3'; Cu,, P5, as described above. The functional Cu. allele was 
distinguished from the nonfunctional allele as described previously (37). 
DNA was analyzed using SeqMan (DNAStar, Madison, WI), and statisti- 
cal analysis was performed using a chi-square test. 

Extraction of RNA and quantitative real-time RT-PCR. RNA was 
extracted as reported previously (30). Five micrograms of RNA was re- 
verse transcribed using the Superscript III first-strand synthesis system 
(Invitrogen) . Quantitative real-time PCR of cDNA was performed in trip- 
licate using the DNA Engine Opticon 2 cycler (MJ Research, Waltham, 
MA) and QuantiTect SYBR Green (Qiagen) by using 40 cycles of 94°C for 
15 s, 60°C for 30 s, and 72°C for 30 s. The following primers were used: 
IgM, 5' GACACGTCCAAGAAACAGC 3' (5' primer) and 5' GCCCTAG 
TAATAACTCTCGC 3' (3' primer); AID, 5' CTTCGCAATAAGAACGG 
CTG3' (5' primer) and 5' GAGGTGAACCAGGTGACGC 3 ' (3' primer); 
glyceraldehyde-3-phosphate dehydrogenase (GAPDH), 5' CTGCGACC 
GCCCCCGAACCG 3' (5' primer) and 5' TACCCTGCCCCCATACGAC 
TGCAAAGACC 3' (3' primer). Data were analyzed using the Opticon 
Monitor 3 program (MJ Research), and relative fold changes in cDNA 
levels were calculated using the threshold cycle (2~ AACr ) method with the 
GAPDH reference gene as normalization (66). 

Moving average analysis. Because the mutation frequency in Ramos 
cells is relatively low, a direct site-by-site comparison of mutation fre- 
quencies is inappropriate. We therefore compared the overall patterns of 
unique mutation frequencies using a moving average, i.e., the mean mu- 
tation frequency within a sliding window of length 100 along the length of 
the sequence. In each case, we also tested with shorter window lengths of 
80 and 60 with similar results. We then calculated the correlation between 
the two moving averages. When comparing data sets with nonidentical 
germ line sequences, we excluded any sites that were not shared between 
the two germ line sequences. 
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